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RAW SEQUENCE LISTING 
ERROR REPORT 



BIO CHNOLOGY 
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The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application 



Serial Number: Ol (Ml H I A 



Source: 

Date Processed by STIC: 



THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 

APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 

NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: natin21heln(S)uspto.gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: natin3help(5),uspto.gov or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.0 PROGRAM. ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW: 



Checker Version 3.0 

The Checker Version 3.0 application is a state-of the-art Windows based software program 
employing a logical and intuitive user-interface to check whether a sequence listing is in 
compliance with format and content rules. Checker Version 3.0 works for sequence listings 
generated for the original version of 37 CFR §§1.821 - 1.825 effective October 1, 1990 (old 
rules) and the revised version (new rules) effective July 1, 1998 as well as World Intellectual 
Property Organization (WIPO) Standard ST.25. 

Checker Version 3 .0 replaces the previous DOS-based version of Checker, and is Y2K- 
compliant. Checker allows public users to check sequence listings in Computer Readable form 
(CRF) before submitting them to the United States Patent and Trademark Office (USPTO). 
Use of Checker prior to filing the sequence listing is expected to result in fewer errored sequence 
listings, thus saving time and money. 



Checker Version 3.0 can he down loaded from the USPTO webs ite at the following address: 

http://www.uspto.gov/web/offices/pac/checker 



Raw Sequence Listing Error Summary 



F.RROR DETECTED 

ATTN: NEW RULES CASES 

1 Wrapped Nuclcics 

Wrapped Aminos 



Invalid Line Length 

^Misaligned Amino 
Numbering 

Non-ASCII 



Variable Length 



Palentln 2.0 
"bug" 



Skipped Sequences 
(OLD RULES) 



_Skipped Sequences 
(NEW RULES) 




Use of n's or Xaa's 
(NEW RULES) 



10 Invalid <213> 

Response 



1 1 



Use of<220> 



12 



Palentln 2.0 
"bug" 



SERIAL NUMBER: 



SUCCESTF" CORRECTION 
: PLEASE DISRECARD ENGLISH "ALPHA" HEADERS, WHICH WERE INSERTED BY PTO SOFH 

The numberAext .t the end of««ch line "wrapped" down lo the next line. This may c«cur if your file 
was S in a word processor .IUr creating it Please adjust your nght marpn to .3; Ih.. wdl 
prevent "wrapping." 

The rules require that a line not exceed 72 characters in length, this includes white spaces. 

The numbering under each 5* amino acid is misaligned. Do not use tab codes between numbers; 
use apace characters, instead. 

The submitted file was not saved in ASCI](DOS) text, as required by the Sequence Rules: Please 
ensure your aubsequent submission Is saved In i ASCII text. 

Seouencefs) contain n's or Xaas representing more than one residue. Per Sequence Rules. 
Lch n .r X^only represent a .Ingle residue. Please present the maximum number of each 
residue rlv£ variable length and indicate in the <220>-<223> section that some may be rn.ss.ng. 

a «h..o» in P«t«.tln version~2.0 hascaused <Ke <220'>-<223>«ection to be missing from amind acW- "' .: 
JZLZZ* NormallyTpatentin wbuld.auiomatic.il y generate this section from the 

ZSZ ^^fidldl^^cid sequence. Please manually copy the ~te-n« <220>.<223> aecUon ,o 
LXequTnt amino acid sequence. This applies to the mandatory <220>-<223> sect.ons for 
Artificial or Unknown sequences. 

Sequence^) missing. If intentional, please insert the following .lines for each skipped sequence: 
rr> INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where is shown) 
(2 INFORMATKK^rU CHA ^ ACTERISTICS: (Do nol insert lny subheadings under Ih.s headmg) 

(id) SEQUENCE DESCRIPTION : SEQ ID NO:X: (insert SEQ ID NO where "X" .s shown) 
This sequence is intentionally skipped 

Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 

Sequence(s) missing. If Intentional, please insert the following lines for each skipped sequence. 

<210> sequence id number 
<400> sequence id number 
000 

Use of n's and/or Xaa's have been detected in the Sequence Listing. 

p , Mm a Sequence Rules use of <220>-<223> is MANDATORY if n's or Xaa's are present. 
uiSSSS^^^ explain location of n or Xaa. and which residue „ or Xaa represents. 

o i «oi «fQ«,..«ice Rules the only vaUd <213> responses are: Unknown, Artificial Sequence, or 
l^SSSS^S^S <S>-<«3> sectionTs required when <2.3> response is Unknown or 
is Artificial Sequence 

Seauencrfs) missing the <220> "Feature" and associated.numeric identifiers and responses. 

iSSoV^SS is MANDATORY if <213> "Organism" response is "Art.fical Sequence or 
-iintnAwn " Please explain source of genetic material in <220> to <223> section. 
( ^«^ ^^1/1998, vIl63.No. .04. pp. 29631-32) (See. 1.823 ofSequence Rules) 

Please do not use "Copy to Disk" function of Palentln version 2.0. This causes a corrupted file. 
S :inrlussing mandatory numeric identifiers and responses (as indicated raw seque^c 
listing) Instead, please use "File Manager" or any other manual means to copy file to floppy d.sk. 



13 



Misuse of n 



n can only be used to represent a single nucleotide in a nucleic acid sequence. N is not used to represent 
any value not specifically a nucleotide. 

AMC/MH - Biotechnology Systems Branch -08/21/2001 
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OIPE 



RAW SEQUENCE LISTING DATE: 08/09/2001 

PATENT APPLICATION: US/09/847 r 081A TIME: 12:44:02 

Input Set : A:\Mo6314 sequence-TEXT.txt 
Output Set: N:\CRF3\08092001\I847081A.raw 

3 <110> APPLICANT: BAYER AG 

5 <120> TITLE OF INVENTION: DNA encoding the tobacco phytoene synthase 
7 <130> FILE REFERENCE: Le A 34 326 



C--> 9 <140> CURRENT APPLICATION NUMBER: US/09/847 f 081A 





<141> CURRENT FILING DATE: 


2001 


-05- 


02 


















12 


<160> NUMBER OF 


SEQ ID 


NOS: 


10 
























<170> SOFTWARE: 


Patentln Ver. 2 


.1 


















J- u 

17 
18 


<210> SEQ ID NO: 1 
<211> LENGTH: 1728 
<212> TYPE: DNA 


















"oes Not Comply 
Corrocted Diskette Needed 


1 Q 

21 


<213> ORGANISM: 
<220> FEATURE: 


Nicotiana tabacum 
















\ 


r 0 


22 


<221> NAME/KEY: 


CDS 






















r 

\ 






<222> LOCATION: 


(244). . 


(1566) 






















25 


<400> SEQUENCE: 


1 


























z 0 


agaaacccag 


aaagaacaac aggttttget tcttgttgat 


gagtgcattt 


gcctctgctt 


60 




9ft 
z o 


gtgtaaggca 


aagtcggttc actttcttat atccgatttt 


tataategtt 


gaaattagtg 


1 90 
-L A yj 




30 


gatagactct 


agtggatatc tacaagtatt ggttttttga 


taaaataggc 


tgaggtgaga 


180 




32 


aggtaacata 


aaggaaagac aaaaacttgg gaattgtttt 


agaccaccga 


ggtttcttgt 


940 




j *± 


ttc 


atg 


age 


atg 


tct gtt 


get 


ttg 


ttg 


tgg 


gtt 


gtt 


tct 


ccc 


act 


tec 


9ft ft 
zoo 




35 




Met 


Ser 


Met 


Ser Val 


Ala 


Leu 


Leu 


Trp 


Val 


Val 


Ser 


Pro 


Thr 


Ser 






•J o 




1 






5 










10 










15 






^ft 


gag 


gtc 


teg 


aat 


ggg aca 


gga 


ttg 


ttg 


gat 


tea 


gtc 


cga 


gaa 


gga 


aac 


336 




40 


Glu 


Val 


Ser 


Asn 


Gly Thr 


Gly Leu 


Leu 


Asp 


Ser 


Val 


Arg 


Glu 


Gly 


Asn 
















20 








25 










30 










cgc 


gtc 


ttt 


gta 


tea tec 


agg 


ttc 


eta 


get 


cga 


gat 


agg 


aat 


ttg 


atg 


384 




45 


Arg 


Val 


Phe 


Val 


Ser Ser 


Arg 


Phe 


Leu 


Ala 


Arg 


Asp 


Arg 


Asn 


Leu 


Met 






46 








35 








40 










45 










48 


tgg 


aat 


ggg 


aga 


ate aag 


aaa 


ggt 


ggg 


aga 


caa 


agg 


tgg 


aat 


ttt 


ggc 


432 




50 


Trp 


Asn 


Gly 


Arg 


He Lys 


Lys 


Gly 


Gly 


Arg 


Gin 


Arg 


Trp 


Asn 


Phe 


Gly 






51 






50 








55 










60 












53 


tct 


tta 


att 


get 


gat cca 


aga 


tat 


tea 


tgc 


ttg 


ggt 


gga 


tea 


aga 


act 


480 




56 


Ser 


Leu 


He 


Ala 


Asp Pro 


Arg 


Tyr 


Ser 


Cys 


Leu 


Gly 


Gly 


Ser 


Arg 


Thr 






57 




65 








70 










75 














59 


gaa 


aag 


gga 


age 


act ttc 


tct 


gta 


cag 


tec 


agt 


ttg 


gtg 


get 


age 


cca 


528 




61 


Glu 


Lys 


Gly 


Ser 


Thr Phe 


Ser 


Val 


Gin 


Ser 


Ser 


Leu 


Val 


Ala 


Ser 


Pro 






62 


80 








85 










90 










95 






64 


get 


gga 


gaa 


atg 


act gtg 


tea 


tea 


gag 


aaa 


aag 


gtg 


tat 


gat 


gtg 


gta 


576 




66 


Ala 


Gly 


Glu 


Met 


Thr Val 


Ser 


Ser 


Glu 


Lys 


Lys 


Val 


Tyr 


Asp 


Val 


Val 






67 










100" 








105 










-110 








69 


tta 


aag 


cag 


gca 


gct tta 


gtg 


aag 


agg 


cag 


ctg 


aga 


tct 


acc 


gat 


gat 


624 




71 


Leu 


Lys 


Gin 


Ala 


Ala Leu 


Val 


L YS 


Arg 


Gin 


Leu 


Arg 


Ser 


Thr 


Asp 


Asp 






72 








115 








120 










125 










74 


tta 


gaa 


gtg 


aag 


ccg gat 


att 


gtt 


gtt 


cca 


ggg 


aat 


ttg 


ggc 


ttg 


ttg 


672 




76 


Leu 


Glu 


Val 


Lys 


Pro Asp 


He 


Val 


Val 


Pro Gly Asn 


Leu 


Gly 


Leu 


Leu 






77 






130 








135 










140 












79 


agt 


gaa 


gca 


tat 


gat cgt 


tgt 


ggc 


gaa 


gta 


tgt 


gca 


gag 


tat 


gca 


aag 


720 
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RAW SEQUENCE LISTING DATE: 08/09/2001 

PATENT APPLICATION: US/09/84 7 , 081A TIME: 12:44:02 

Input Set : A:\Mo6314 sequence-TEXT.txt 
Output Set: N:\CRF3\08092001\I847081A.raw 



81 


Ser 


Glu 


Ala 


Tyr Asp Arg 


Cys 


oj_y 


tjlU 


vai 


Cys 


Ala 


GlU 


Tyr 


Ala 


Lys 




82 




145 






150 










155 








84 


aca 


ttt 


tac 


tta gga acc 


aag 


eta 


atg 


acc 


cca 


gag 


aga 


aga 


aga 


get 


768 


86 


Thr 


Phe 


Tyr Leu Gly Thr 


Lys 


Leu 


Met 


Thr 


Pro 


Glu 


Arg 


Arg 


Arg 


Ala 




87 


160 






165 










170 








175 




89 


ate 


tgg 


gca 


ata tat gtg 


tgg 


tgc 


agg 


aga 


acg 


gat 


gag 


ctt 


gtt 


gat 


816 


91 


He 


Trp 


Ala 


He Tyr Val 


Trp 


Cys 


Arg 


Arg 


Thr 


Asp 


Glu 


Leu 


Val 


Asp 




92 








180 








185 










190 




94 


ggc 


cct 


aat 


gca tec cac 


ata 


act 


ccg 


caa 


get 


tta 


gat 


agg 


tgg 


gag 


864 


96 


Gly 


Pro 


Asn 


Ala Ser His 


He 


Thr 


Pro 


Gin 


Ala 


Leu 


Asp 


Arg 


Trp 


Glu 




97 








195 






200 








205 






99 


acc 


agg 


ctg 


gaa gat att 


ttc 


agt 


ggg 


egg 


cca 


ttt 


gat 


atg 


Ctt" 


gat 


912 



101 Thr Arg Leu Glu Asp He Phe Ser Gly Arg Pro Phe Asp Met Leu Asp 

102 210 215 220 
104 get get tta tec gat act gtc tec aga ttt cct gtt gat att cag cca 960 

106 Ala Ala Leu Ser Asp Thr Val Ser Arg Phe Pro Val Asp He Gin Pro 

107 225 230 235 

109 ttc aga gat atg att gaa gga atg cgt atg gac ttg tgg aaa tec aga 1008 

111 Phe Arg Asp Met He Glu Gly Met Arg Met Asp Leu Trp Lys Ser Arg 

112 240 245 250 255 

114 tac aaa act ttc gat gag eta tat etc tat tgt tac tat gtt get ggt 1056 

116 Tyr Lys Thr Phe Asp Glu Leu Tyr Leu Tyr Cys Tyr Tyr Val Ala Gly 

117 260 265 270 

119 act gta gga ttg atg agt gtt cca gtt atg ggt att gca cct gaa tea 1104 

121 Thr Val Gly Leu Met Ser Val Pro Val Met Gly He Ala Pro Glu Ser 

122 275 280 285 

124 aag gca aca aca gag agt gta tat aat get get ttg get tta ggg ctt 1152 

126 Lys Ala Thr Thr Glu Ser Val Tyr Asn Ala Ala Leu Ala Leu Gly Leu 

127 290 295 300 

129 gca aat caa eta acc aat ata etc aga gat gta gga gaa gat gee aga 1200 

131 Ala Asn Gin Leu Thr Asn He Leu Arg Asp Val Gly Glu Asp Ala Arg 

132 305 310 315 

134 aga gga aga gta tac ttg cct caa gat gaa tta gca cag gca ggg etc 1248 

136 Arg Gly Arg Val Tyr Leu Pro Gin Asp Glu Leu Ala Gin Ala Gly Leu 

137 320 325 330 335 

139 tec gac gaa gac ata ttt get gga aga gtg act gat aag tgg agg aac 1296 

141 Ser Asp Glu Asp He Phe Ala Gly Arg Val Thr Asp Lys Trp Arg Asn 

142 340 345 350 

144 ttt atg aag aaa caa att cag agg gcg agg aaa ttc ttt gat gag tea 1344 

146 Phe Met Lys Lys Gin lie Gin Arg Ala Arg Lys Phe Phe Asp Glu Ser 

147 355 360 365 

149 gag aaa ggt gtc aca gaa ctg gac tct get agt aga tgg cct gtg tta 1392 
151 Glu Lys Gly Val Thr Glu Leu Asp Ser Ala Ser Arg Trp Pro Val Leu 
.152 , .370 -. . , . _ 375 . . _ m 380_ 

154 aca gcg ctg ctg ttg tat cgc aag ata ttg gac gag att gaa gee aac 1440 

156 Thr Ala Leu Leu Leu Tyr Arg Lys He Leu Asp Glu He Glu Ala Asn 

157 385 390 395 

159 gac tac aac aac ttc aca agg agg get tat gtt age aag cca aag aag 1488 
161 Asp Tyr Asn Asn Phe Thr Arg Arg Ala Tyr Val Ser Lys Pro Lys Lys 
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RAW SEQUENCE LISTING DATE: 08/09/2001 

PATENT APPLICATION: US/09/847, 081A TIME: 12:44:02 

Input Set : A:\Mo6314 sequence-TEXT.txt 
Output Set: N:\CRF3\08092001\I847081A.raw 

162 400 405 410 415 

164 ctt etc acc ttg ccc att get tat gca aaa tct ctt gtg ccc cct aat 1536 

166 Leu Leu Thr Leu Pro lie Ala Tyr Ala Lys Ser Leu Val Pro Pro Asn 

167 420 425 430 

169 aga act tec tct cca eta gca aag aca tga atgaagtagt tgagtcaatg 1586 

171 Arg Thr Ser Ser Pro Leu Ala Lys Thr 

172 435 440 

174 agtattatac actaaagaaa ctcaggtact tgtaaatgag atatcttttg ctaaatgtgt 1646 
176 atcatcaaaa gtagattgta aattcaatat gacaatctct tggtagaata ttttctccac 1706 
178 actcatcaaa ccctcaagtg ag 1728 

181 <210> SEQ ID NO: 2 

182 <211> LENGTH: 440 

183 <212> TYPE: PRT 

184 <213> ORGANISM: Nicotiana tabacum 

186 <400> SEQUENCE: 2 

187 Met Ser Met Ser Val Ala Leu Leu Trp Val Val Ser Pro Thr Ser Glu 

188 15 10 15 

190 Val Ser Asn Gly Thr Gly Leu Leu Asp Ser Val Arg Glu Gly Asn Arg 

191 20 25 30 

193 Val Phe Val Ser Ser Arg Phe Leu Ala Arg Asp Arg Asn Leu Met Trp 

194 35 40 45 

196 Asn Gly Arg He Lys Lys Gly Gly Arg Gin Arg Trp Asn Phe Gly Ser 

197 50 55 60 

199 Leu He Ala Asp Pro Arg Tyr Ser Cys Leu Gly Gly Ser Arg Thr Glu 

200 65 70 75 80 

202 Lys Gly Ser Thr Phe Ser Val Gin Ser Ser Leu Val Ala Ser Pro Ala 

203 85 90 95 

205 Gly Glu Met Thr Val Ser Ser Glu Lys Lys Val Tyr Asp Val Val Leu 

206 100 105 110 

208 Lys Gin Ala Ala Leu Val Lys Arg Gin Leu Arg Ser Thr Asp Asp Leu 

209 115 120 125 

211 Glu Val Lys Pro Asp He Val Val Pro Gly Asn Leu Gly Leu Leu Ser 

212 130 135 140 

214 Glu Ala Tyr Asp Arg Cys Gly Glu Val Cys Ala Glu Tyr Ala Lys Thr 

215 145 150 155 160 

217 Phe Tyr Leu Gly Thr Lys Leu Met Thr Pro Glu Arg Arg Arg Ala He 

218 165 170 175 

220 Trp Ala He Tyr Val Trp Cys Arg Arg Thr Asp Glu Leu Val Asp Gly 

221 180 185 190 

223 Pro Asn Ala Ser His He Thr Pro Gin Ala Leu Asp Arg Trp Glu Thr 

224 195 200 205 

226 Arg Leu Glu Asp He Phe Ser Gly Arg Pro Phe Asp Met Leu Asp Ala 

227 210 215 220 

J^SL.Ala^ Leu_ Ser Asp Thr Val Ser Arg Phe Pro Val Asp He Gin Pro Phe 
230 225 230 235 " 240 

232 Arg Asp Met He Glu Gly Met Arg Met Asp Leu Trp Lys Ser Arg Tyr 

233 245 250 255 

235 Lys Thr Phe Asp Glu Leu Tyr Leu Tyr Cys Tyr Tyr Val Ala Gly Thr 

236 260 265 270 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/84 7 , 081A 



DATE: 08/09/2001 
TIME: 12:44:02 



Input Set : A:\Mo6314 sequence-TEXT.txt 
Output Set: N:\CRF3\08092001\I847081A.raw 



238 


Val 


Gly 


Leu 


Met 


Ser 


Val 


Pro Val 


Met 


Gly 


He 


Ala 


Pro 


Glu 


Ser 


Lys 


239 






275 








280 










285 








241 


Ala 


Thr 


Thr 


Glu 


Ser 


Val 


Tyr Asn 


Ala 


Ala 


Leu 


Ala 


Leu 


Gly 


Leu 


Ala 


242 




290 










295 








300 










244 


Asn 


Gin 


Leu 


Thr 


Asn 


He 


Leu Arg 


Asp 


Val 


Gly 


Glu 


Asp 


Ala 


Arg 


Arg 


245 


305 










310 








315 










320 


247 


Gly Arg 


Val 


Tyr 


Leu 


Pro 


Gin Asp 


Glu 


Leu 


Ala 


Gin 


Ala 


Gly 


Leu 


Ser 


248 










325 








330 










335 




250 


Asp 


Glu 


Asp 


He 


Phe 


Ala 


Gly Arg 


Val 


Thr 


Asp 


Lys 


Trp 


Arg 


Asn 


Phe 


251 








340 








345 










350 






253 


Met 


Lys 


Lys 


Gin 


He 


Gin 


Arg Ala 


Arg 


Lys 


Phe 


Phe 


Asp 


Glu 


Ser 


Glu 


254 






355 








360 










365 








256 


Lys 


Gly 


Val 


Thr 


Glu 


Leu 


Asp Ser 


Ala 


Ser 


Arg 


Trp 


Pro 


Val 


Leu 


Thr 


257 




370 










375 








380 










259 


Ala 


Leu 


Leu 


Leu 


Tyr 


Arg 


Lys He 


Leu 


Asp 


Glu 


He 


Glu 


Ala 


Asn 


Asp 


260 


385 










390 








395 










400 


262 


Tyr 


Asn 


Asn 


Phe 


Thr 


Arg 


Arg Ala 


Tyr 


Val 


Ser 


Lys 


Pro 


Lys 


Lys 


Leu 


263 










405 








410 










415 




265 


Leu 


Thr 


Leu 


Pro 


He 


Ala 


Tyr Ala 


Lys 


Ser 


Leu 


Val 


Pro 


Pro 


Asn 


Arg 


266 








420 








425 










430 






268 


Thr 


Ser 


Ser 


Pro 


Leu 


Ala 


Lys Thr 


















269 






435 








440 








\ 


x/A 









w- 
w- 



273 
274 
275 
276 
278 
279 
280 
282 
W--> 283 
284 
285 
286 
287 
289 
291 
293 
295 
297 
298 
299 
301 
303 
304 
306 
308 
309 
311 



<210> SEQ ID NO: 3 
<211> LENGTH: 1712 
<212> TYPE: DNA 
<213> ORGANISM: 
<220> FEATURE: 
<221> NAME/KEY: 
<222> LOCATION: 
<220> FEATURE: 



Nicotiana tabacum 



CDS 
(333) 




Xaa 



is unknown or other 



( 



(1565) 

<221> NAME/KEY: 

<222> LOCATION: 
<223> OTHER INFORMATION 
<400> SEQUENCE: 3 
cttgaagagt agcagcagca agcaagahaa ttaaagtggg ctatttbkkaf^n^agccattg 

ttacmagara attaagaagc caagamacag gttattttct acttgagtya ggaaaagttg 
gtttgcttta tttgtgggct ttttataatc ttttttccac aagggaaagt gggtattttc 
ttgaaagtgg atttagactc tagtgggaat ctactaggag taaatttatt aattttttat 
aaattaagca gaggaaggaa ggaaacagaa aacagaaagt aagacaaaaa accttggaat 
tgttttagaa agccaaggtt ttcctgttca aa atg tct gtt gcc ttg tta tgg 

Met Ser Val Ala Leu Leu Trp 
1 5 
gtt gtt tea cct tgt gaa gtc tea aat ggg aca gga ttc ttg gat tea 
Val Val Ser Pro Cys Glu Val Ser Asn Gly Thr Gly Phe Leu Asp Ser 

10 15 "~ " " ~~" 20 

gtc egg gag gga aac egg gtt ttt gat teg teg agg cat agg aat tta 
Val Arg Glu Gly Asn Arg Val Phe Asp Ser Ser Arg His Arg Asn Leu 

25 30 35 

gtg tgc aat gag aga aac aag aga ggt gtg aaa caa agg tgg aat ttt 



5 v-r e 



60 

120 
180 
240 
300 
353 



401 



449 



497 
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RAW SEQUENCE LISTING DATE : 08/09/2001 

PATENT APPLICATION: US/09/847, 081A TIME: 12:44:02 

Input Set : A:\Mo6314 sequence-TEXT.txt 
Output Set: N:\CRF3\08092001\I847081A.raw 

313 Val Cys Asn Glu Arg Asn Lys Arg Gly Val Lys Gin Arg Trp Asn Phe 

314 40 45 50 55 

316 ggt tct gta agg tct get atg gtg get aca ccg gcg gga gaa atg gcg 545 

318 Gly Ser Val Arg Ser Ala Met Val Ala Thr Pro Ala Gly Glu Met Ala 

319 60 65 70 

321 acg atg aca tea gaa cag atg gtt tat gat gtg gtt tta aaa caa gea 593 

323 Thr Met Thr Ser Glu Gin Met Val Tyr Asp Val Val Leu Lys Gin Ala 

324 75 80 85 

326 get tta gtg aag agg cag ttg aga tct get gat gat tta gaa gtg aag 641 

328 Ala Leu Val Lys Arg Gin Leu Arg Ser Ala Asp Asp Leu Glu Val Lys 

329 90 95 100 

331 ccg gag ate cct etc ccc ggg aat ttg age ttg ttg agt gaa gca tat 689 

333 Pro Glu lie Pro Leu Pro Gly Asn Leu Ser Leu Leu Ser Glu Ala Tyr 

334 105 110 115 

336 gat agg tgt agt gaa gta tgt gca gag tat gca aag aca ttt tac tth 737 
W--> 33 8 Asp Arg Cys Ser Glu Val Cys Ala Glu Tyr Ala Lys Thr Phe Tyr Xaa 
339 120 125 130 135 

341 gga ace atg yta atg act eca gag aga aga agg get att tgg gca ata 785 
W--> 34 3 Gly Thr Met Xaa Met Thr Pro Glu Arg Arg Arg Ala lie Trp Ala lie 
344 140 145 150 

34 6 tat gtg tgg tgc agg aga aca gat gaa ctt gtt gat ggc eca aac gca 833 

348 Tyr Val Trp Cys Arg Arg Thr Asp Glu Leu Val Asp Gly Pro Asn Ala 

349 155 160 165 

351 tea cat att aca ccc caa gee tta gat agg tgg gaa gac egg ctt gaa 881 

353 Ser His He Thr Pro Gin Ala Leu Asp Arg Trp Glu Asp Arg Leu Glu 

354 170 175 180 

3 56 gat gtt ttc age ggg cga eca ttt gat atg etc gat get get ttg tec 929 

358 Asp Val Phe Ser Gly Arg Pro Phe Asp Met Leu Asp Ala Ala Leu Ser 

359 185 190 195 

3 61 gat act gtt tec aag ttt eca gtt gat att cag ccg ttc aga gat atg 977 

363 Asp Thr Val Ser Lys Phe Pro Val Asp He Gin Pro Phe Arg Asp Met 

364 200 205 210 215 

366 att gaa gga' atg cgt atg gac ttg agg aag tea aga tat aga aac ttt 1025 

368 He Glu Gly Met Arg Met Asp Leu Arg Lys Ser Arg Tyr Arg Asn Phe 

369 220 225 230 

371 gat gag ctt tac etc tat tgt tat tac gtt get ggt acg gtt ggg ttg 1073 

373 Asp Glu Leu Tyr Leu Tyr Cys Tyr Tyr Val Ala Gly Thr Val Gly Leu 

374 235 240 245 

3 76 atg agt gtt eca att atg ggt att gca cct gat tea aag gca aca aca 1121 

378 Met Ser Val Pro He Met Gly He Ala Pro Asp Ser Lys Ala Thr Thr 

379 250 255 260 

381 gag age gta tat aat gca get ttg get tta gga ate gca aat caa eta 1169 

383 Glu Ser Val Tyr Asn Ala Ala Leu Ala Leu Gly He Ala Asn Gin Leu 

384 265 _ 270 275 

386 acg aac ata etc aga gat gtt gga gaa gat gee aga aga gga aga gtc 1217 

388 Thr Asn He Leu Arg Asp Val Gly Glu Asp Ala Arg Arg Gly Arg Val 

389 280 285 290 ' 295 

391 tac tta cct caa gat gaa tta gca cag gca ggt etc ttc gac gat gac 1265 
393 Tyr Leu Pro Gin Asp Glu Leu Ala Gin Ala Gly Leu Phe Asp Asp Asp 
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VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/847, 081A 



DATE: 08/09/2001 
TIME: 12:44:04 



Input Set : A:\Mo6314 sequence-TEXT.txt 
Output Set: N:\CRF3\08092001\I847081A.raw 



L:9 M:270 C: Current Application Number differs, Replaced Application Number 

L:10 M:271 C: Current Filing Date differs, Replaced Current Filing Date 

L:283 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 3 

L:286 M:283 W: Missing Blank Line separator, <400> field identifier 

L:287 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:338 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:343 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 3 

L:442 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID# : 4 
L:472 M:341 W: (46) "n" or "Xaa" used, for SEQ ID# : 4 

L:892 M:334 W: (2) Invalid Amino Acid in Coding Region, NUMBER OF INVALID KEYS : 5 
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